Gene length and codon usage bias in Drosophila melanogaster, Saccharomyces cerevisiae and Escherichia coli
نویسندگان
چکیده
The relationship between gene length and synonymous codon usage bias was investigated in Drosophila melanogaster, Escherichia coli and Saccharomyces cerevisiae. Simulation studies indicate that the correlations observed in the three organisms are unlikely to be due to sampling errors or any potential bias in the methods used to measure codon usage bias. The correlation was significantly positive in E.coli genes, whereas negative correlations were obtained for D. melanogaster and S.cerevisiae genes. When only ribosomal protein genes were used, whose expression levels are assumed to be similar, E.coli and S.cerevisiae showed significantly positive correlations. For the two eukaryotes, the distribution of effective number of codons was different in short genes (300-500 bp) compared with longer genes; this was not observed in E.coli. Both positive and negative correlations can be explained by translational selection. Energetically costly longer genes have higher codon usage bias to maximize translational efficiency. Selection may also be acting to reduce the size of highly expressed proteins, and the effect is particularly pronounced in eukaryotes. The different relationships between codon usage bias and gene length observed in prokaryotes and eukaryotes may be the consequence of these different types of selection.
منابع مشابه
Intragenic spatial patterns of codon usage bias in prokaryotic and eukaryotic genomes.
To study the roles of translational accuracy, translational efficiency, and the Hill-Robertson effect in codon usage bias, we studied the intragenic spatial distribution of synonymous codon usage bias in four prokaryotic (Escherichia coli, Bacillus subtilis, Sulfolobus tokodaii, and Thermotoga maritima) and two eukaryotic (Saccharomyces cerevisiae and Drosophila melanogaster) genomes. We genera...
متن کاملThe extent of gene essentiality and buffering by duplicates is not conserved across organisms Supplementary Material
Abbreviations: CAI – Codon Adaptation Index; D – effective gene family size (number of additional gene duplicates); E-value – expectation value; KD – knockdown; KO – knockout; MIPS Munich Information Center for Protein Sequences; P(S) – probability of survival upon singleor double-gene KO or KD; R – squared Pearson correlation coefficient; SGA– Synthetic Genetic Array; SSL – synthetic sick or l...
متن کاملCodon usage patterns in Escherichia coli, Bacillus subtilis, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Drosophila melanogaster and Homo sapiens; a review of the considerable within-species diversity.
The genetic code is degenerate, but alternative synonymous codons are generally not used with equal frequency. Since the pioneering work of Grantham's group it has been apparent that genes from one species often share similarities in codon frequency; under the "genome hypothesis" there is a species-specific pattern to codon usage. However, it has become clear that in most species there are also...
متن کاملIdentification of Synonymous Codon Usage Bias in the Pseudorabies Virus UL31 Gene
Background: Little knowledge of synonymous codon usage pattern of pseudorabies virus (PRV) genome, especially the UL31 gene in the process for its evolution is available. Objectives: In the present study, the codon usage bias between PRV UL31 sequence and the UL31-like sequences was identified. Materials and Methods: We used a comprehensive analysi...
متن کاملConserved codon composition of ribosomal protein coding genes in Escherichia coli, Mycobacterium tuberculosis and Saccharomyces cerevisiae: lessons from supervised machine learning in functional genomics.
Genomics projects have resulted in a flood of sequence data. Functional annotation currently relies almost exclusively on inter-species sequence comparison and is restricted in cases of limited data from related species and widely divergent sequences with no known homologs. Here, we demonstrate that codon composition, a fusion of codon usage bias and amino acid composition signals, can accurate...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Nucleic acids research
دوره 26 13 شماره
صفحات -
تاریخ انتشار 1998